Model Selection

Academic VQA

# Academic VQA

LLaVA-Gemma-7b is a large multimodal model trained based on the LLaVA-v1.5 framework, using google/gemma-7b-it as the language backbone combined with a CLIP visual encoder, suitable for multimodal understanding and generation tasks.

Transformers English

Llava V1.5 7b Gguf

LLaVA is an open-source multimodal chatbot, fine-tuned on LLaMA/Vicuna and trained with GPT-generated multimodal instruction-following data.

Llava V1.5 13B AWQ

LLaVA is an open-source multimodal chatbot, fine-tuned on GPT-generated multimodal instruction-following data based on LLaMA/Vicuna.

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase